CMB Polarization at Large Angular Scales: Data Analysis of the POLAR Experiment 
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The coming flood of CMB polarization experiments, spurred by the recent detection of CMB 
polarization by DASI and WMAP, will be confronted by many new analysis tasks specific to polar- 
ization. For the analysis of CMB polarization data sets, the devil is truly in the details. With this 
in mind, we present the details of the data analysis for the POLAR experiment, which recently led 
to the tightest upper limits on the polarization of the Cosmic Microwave Background Radiation at 
large angular scales. We discuss the data selection process, mapmaking algorithms, offset removal, 
and the likelihood analysis which were used to find upper limits on the polarization. Stated using 
the modern convention for reporting CMB Stokes parameters, these limits are 5.0 /iif on both E- 
type and S-type polarization at 95% confidence. Finally, we discuss simulations used to test our 
analysis techniques and to probe the fundamental limitations of the experiment. 
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I. INTRODUCTION 

The detection of polarization in the Cosmic Microwave 
Background (CMB) has been a long sought goal for 
cosmology. CMB polarization was recently detected at 
sub-degree angular scales by the DASI instrument, a 
ground-based interferometer [1], and the temperature- 
polarization correlation has now been detected at larger 
angular scales by the WMAP satellite [2]. While infor- 
mation in the CMB polarization at degree and sub-degree 
angular scales further confirms the standard cosmological 
model, polarization information at larger angular scales 
has the potential to provide additional information re- 
garding the formation and evolution of the universe. The 
process of reionization leaves a characteristic signature 
on CMB polarization at large angular scales which can 
be used as a means to determine the epoch of reioniza- 
tion [3-5] . The power spectrum of CMB polarization at 
angular scales greater than a degree or so is sensitive to 
inflationary model parameters such as the inflaton poten- 
tial and the energy scale of inflation, as well as primordial 
gravitational waves [6-10]. 

These exciting rewards, taken together with increas- 
ingly sensitive receiver technologies, have set the stage 
for a host of new CMB polarization experiments. These 
experiments will be faced with new and more challenging 
analysis tasks than for the simpler case of anisotropy, and 
it is with that in mind that we now set out to report the 
details of the data analysis of the POLAR experiment. 

POLAR (Polarization Observations of Large Angular 
Regions) was designed to measure the polarization of the 
CMB on large angular scales, in the Ka frequency band 
from 26 - 36 GHz. This HEMT-based correlation po- 
larimeter operated for a single season in the Spring of 
2000 near Madison, Wisconsin. The data from this single 
season led to simultaneous upper limits on E and B-type 
CMB polarization, results which were initially presented 
in [11] (hereafter KOI). The details of the POLAR in- 



strument and its operation were later described in [12] 
(hereafter K02). In this paper we present the details of 
the data selection and analysis techniques used to ar- 
rive at the results in KOI. We also discuss the results 
of a recent cross-correlation analysis of the POLAR data 
with anisotropy data from the COBE-DMR experiment, 
described fully in [13]. 

The rest of this paper is organized as follows. We 
first briefly discuss the two conventions used to quantify 
CMB polarization. We next review the basic properties 
of the POLAR instrument in §111. In §IV, we discuss 
the data selection procedures that were used to remove 
large amounts of contaminated data. §V describes the 
mapmaking algorithms used to transform the raw data 
into sky maps of Q and U , and provides a full pipeline 
simulation in order to test the algorithms. §VI presents 
the likelihood analysis used to arrive at the upper lim- 
its on CMB polarization, as well as the evaluation of the 
polarization power spectra and some commentary on the 
lack of substantial foreground contamination. Finally, in 
§VII, we discuss the limitations of our experiment that 
could be improved upon in future projects. 



II. Q AND U CONVENTIONS 

In recent years there have appeared in the literature 
two conventions for reporting the Stokes parameters Q 
and U as applied to the CMB. The first convention takes 
(5(x) = Tx{^) — Ty(i), where and Ty are antenna tem- 
peratures measured in orthogonal directions by a single- 
mode radiometer with unit optical efficiency, and x de- 
notes the angle of observation; U is obtained similarly 
by rotating the radiometer coordinates by 45°. These 
antenna temperatures are then converted to thermody- 
namic temperatures in the usual way (see e.g. [14]). This 
convention was employed by the pioneering experiments 
of Lubin and Smoot and others [15, 16], and was also 
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adopted for POLAR. 

The more recent convention (see e.g. [17, 18]) takes 
Q = (Tx — Ty)/2, and also has been adopted by several 
experiments [1, 19-21]. The resuhs herein arc presented 
using the former convention (in thermodynamic units) 
unless otherwise noted^. SpccificaUy. this means that 
our basic limits on E and B type polarization (presented 
in KOI and in §VIA of this paper) must be divided by 
a factor of two when directly comparing to experiments 
or theory employing the latter convention. We originally 
reported our simultaneous upper limits on E-type and B- 
type polarization in KOI as 10.0 fiK at 95% confidence; 
stated using the latter convention, these limits become 
5.0 iiK each. 



III. INSTRUMENT 

POLAR observed the local zenith from the University 
of Wisconsin - Madison's Pine Bluff Observatory in Pine 
Bluff, Wisconsin (Latitude +43°01', Longitude +89°45') 
using a simple drift-scan strategy, with a 7° FWHM beam 
defined by a Ka-hand microwave feed horn. A correla- 
tion radiometer operated as a polarimeter that was in- 
stantaneously sensitive to the U Stokes parameter. The 
full RF band was divided into 3 sub-bands, 32-36 GHz, 
29-32 GHz, and 26-29 GHz. Each of the channels was 
detected by a separate microwave correlator, labelled Jl, 
J2, and J3, respectively. The constant rotation of the in- 
strument about its vertical axis allowed for simultaneous 
detection of both the Q and U Stokes parameters. 

POLAR employed a phase-sensitive detection tech- 
nique. The relative phase of the two arms of the correla- 
tion radiometer was modulated at approximately 1 kHz. 
An analog lock-in circuit produced the actual "in-phase" 
polarization signal. However, the instrument simultane- 
ously locked into the chop frequency shifted by § , yield- 
ing pure noise. We obtained one of these "quadrature- 
phase channels" (QPC) for each of the three in-phase 
channels (IPC). The quadrature-phase channels were a 
good monitor of the noise in our system, and as such 
were employed in several places throughout the analysis 
pipeline. We use the terms Jli, J2i, and J3i to refer to 
the three in-phase channels, while Jlq, J2q, and J3q refer 
to the corresponding quadrature-phase (noise) channels. 

The data were sampled at 20 Hz and continuously 
recorded to files, each containing precisely 9000 samples 
(7.5 minutes) of data; these files are one of the funda- 
mental data units to be discussed throughout this paper. 
The instrument was calibrated daily with a 3- mil thick di- 
electric sheet. We did not calibrate the instrument when 
the weather was poor. The calibration was accurate to 
about 10%, based upon laboratory measurements of the 



Wc shall switch to the latter convention when discussing the 
temperature-polarization correlation results. 



dielectric sheet properties; see [22] for details. Each sec- 
tion of data is defined as the longest period of obser- 
vations between calibrations. Approximately fifty such 
sections were collected throughout the observing season, 
each some 2-24 hours in length. 

POLAR's constant rotation rate allows us to charac- 
terize its response to a polarized signal as follows: 

y{t) = Iq + C cos ujt + S sin ujt 

+ Qcos2u}t + Usm2wt + n{t) (1) 

where lu = 2tt f = 0.2055 rad s^^ was the (angular) rota- 
tion frequency. The constant offset Iq is due to coupling 
of the unpolarized total power signal into the polariza- 
tion channels via the nonzero cross-polarization of the 
instrument. This offset term was typically 10-100 mK, 
depending on the channel; during good weather its sta- 
bility was better than 0.6 mK per hour. C and S are 
signals modulated at the rotation frequency (referred to 
hereafter as signals), and can be caused by various 
types of ground pickup and other systematic effects. Via 
Equation (1), linearly polarized signals will produce a sig- 
nal in the data stream at twice our rotation frequency, 
henceforth called the 2(/) frequency. 



IV. OBSERVATIONS AND DATA SELECTION 

During the observing season, POLAR observed 24 
hours per day over a two month period, yielding roughly 
750 hours of data. However, the observing season con- 
tained a large diversity of weather conditions, and this 
led to a correspondingly large diversity of data quality. 
Developing robust data selection criteria was one of the 
most critical tasks in the data analysis pipeline. Because 
of the data's diversity, we were not able to arrive at a 
single selection criterion; rather, we developed a battery 
of conditions that the data were required to meet before 
being accepted. Many effects can conspire to contami- 
nate the polarization signals, be they instrumental, at- 
mospheric, or celestial. We wish to flag and remove any 
data with a non-cosmological contribution to our signal 
that mimics a cosmological signal in a way that we can- 
not account for and remove. 

The rest of this section will describe the various criteria 
wc established in order to robustly separate out contam- 
inated data. There were three basic types of cuts used: 
instrument-based, celestial, and data-based. Instrument- 
based cuts were those in which our systems were not func- 
tioning properly or led to unanalyzable data. Celestial 
cuts were those in which some non-cosmological celestial 
source was in a position to contaminate our data, such 
as the sun, moon, or galaxy. Data-based cuts were those 
which used noise properties of the data itself to assess 
contamination. Table 1 lists all of the cut criteria used 
in the POLAR analysis; the entries in the table will be 
described in more detail throughout the rest of this sec- 
tion. 
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TABLE I: Amounts of data cut at various steps in the data 
selection process. 



Cut Type 


Hours 


Hours 


Indiv. 




Surviving" 


Cut'' 


Hours Cuf^ 


Pull Data Set 


746.5 (100%) 


(0%) 


(0%) 


Instrument 


629.3 (84.3%) 


117.3 (15.7%) 


117.3 (15.7%) 


Celestial 


270.9 (36.3%) 


358.4 (48%) 


434.3 (58.2%) 


Data-based 


152 (20.4%) 


118.9 (15.9%) 


478 (64%) 


Length-based 


78.1 (10.5%) 


73.8 (9.9%) 


N/A 



"The amount of data left after that cut and all the cuts above it 

have been applied. 
''Amount of data cut at that stage. 

'^Amount of data that would have been cut if that particular cri- 
terion were the only one applied. 



We flagged data at the file level; that is, we either 
kept or cut individual 7.5-minute length data files. This 
proved convenient as well as ensured good noise station- 
arity over an individual file, while allowing for a good 
estimation of the low frequency noise, necessary for our 
mapmaking algorithms. A relatc;d question to the cutting 
timescale is whether or not to cut all channels simulta- 
neously versus separately. An important fact of our ex- 
periment was that some of the effects contaminating the 
data were frequency dependent; in particular, channel J3 
(26-29 GHz) often showed evidence of a spurious signal 
when the other two channels did not. For this reason, we 
performed the data-based cuts on each of the channels 
individually; the instrumental and celestial cuts had no 
frequency dependence, and hence were applied equally to 
the three channels. 



A. Instrument-Based Cuts 

We performed two instrument-based cuts. The first cut 
was to remove any files during which the system wasn't 
rotating for the entire file, or the rotation was unnatu- 
rally slow or jittery. This cut removed 1.8% of the data. 
The other issue was dew formation on the optics, a com- 
mon problem in ground-based experiments. When the 
relative humidity was high for an hour or more, moisture 
condensed on the vacuum window, leading to a spurious 
polarization signal. The mechanism was likely that am- 
bient radiation scattered off the layer of water, leading 
to the spurious signal. Dew contaminated necessitated 
removal of 14% of the data. 



B. Celestial Cuts 

Because the fcedhorn accepts some radiation from 
all angles, a celestial source will contribute a nonzero 
amount of radiation when it is anywhere above the hori- 
zon. We are concerned with celestial objects leaking ra- 
diation through a sidelobe in our beam, or the main lobe 



if the source ever gets close enough to the local zenith. 
The sun, moon, and planets, as well as point sources and 
diffuse radiation from the galaxy are all possible contam- 
inants. 



1. Solar System Objects 

We used the measured beam profile (see K02) to de- 
termine attenuation of the microwave emission from the 
sun, moon, and planets as a function of angle from the 
beam (which equals altitude for our zenith-staring exper- 
iment). Treating the sun as an 11,000 Kelvin blackbody 
with an angular width of 0.5°FWHM [23], and conserva- 
tively assuming 1% polarization, we find contamination 
from the sun is less than < QAfiK for elevations less 
than 70°. Similarly, the lunar model by [24] shows that 
the moon can be treated as a blackbody of approximately 
240 K in its brightest phase, with an angular diameter 
of 0.5° and < 1% polarization. Again assuming 1% po- 
larization, lunar emission falls below 0.05 at zenith 
angles greater than 40°. Using cut elevations of 70°for 
the sun and 40°for the moon removed a sizeable 42% of 
the data. 

In principle, contamination by the planets was also 
possible. The largest planetary signal was estimated to 
be ~ 0.01 IJ.K in total intensity. This was due to Jupiter, 
which reached a maximum of 65°elevation during our ob- 
servations. Including the fraction of the source that is 
polarized will reduce this number even further. Thus in 
the case of POLAR, planetary radiation can be safely 
neglected. 

2. Galactic Foreground Emission 

Finally there is the question of emission from the 
galaxy, either from diffuse or point sources. POLAR had 
a sensitivity to point sources of roughly 2 fj,K /Jy. Us- 
ing the WOMBAT point-source catalog for 30 GHz [25], 
we calculate that there are approximate 40 sources that 
passed through our beam contributing greater than 1 ij,K 
antenna temperature in intensity. Only three sources 
contribute greater than 10 /iK. We chose not to per- 
form a point-source based cut as detecting these sources, 
while challenging, would be both interesting and rela- 
tively easy to identify as galactic versus cosmological in 
origin. 

The observations described were conducted over a wide 
range of galactic latitudes and therefore there is a poten- 
tial for diffuse galactic contamination, especially at low 
latitudes [26, 27]. Galactic synchrotron emission can be 
up to 70% polarized [28]. No maps of polarized syn- 
chrotron emission exist at 30 GHz, and extrapolation of 
measurements at lower frequencies, e.g. [29], is not a re- 
liable probe of synchrotron polarization at 30 GHz due to 
Faraday rotation. Although the unpolarized intensity is 
apparently not correlated with the polarized intensity as 
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(a) 



FIG. 2: 2(f>r vs 14>r for the data channel J2i. Periods con- 
taminated by sun, moon, and dew have been removed. The 
vertical dashed line shows the l<^r < 2.1 cut level. 
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(b) 

FIG. 1: Low frequency power spectrum of two sections of 
data for channel J2i. Panel (a) shows a power spectrum of 
the time stream when the weather is good and systematic ef- 
fects are low. Panel (b) displays a similar period when clouds 
appeared; there is a dramatic appearence of spikes at both the 
10 and 2(/) frequencies. This motivated our using the strength 
of the l(f> peak as a cut statistic. True celestial polarization 
signals will appear at the 20 frequency only. 



shown in [13], we attempt to limit our susceptibility to 
synchrotron emission by only using data corresponding 
to Galactic latitudes |6| > 25°. 



C. Data-Based Cuts 

Together, the instrument-based and celestial cuts re- 
moved approximately 64% of the original 750-hour data 
set. Cursory analysis, however, immediately revealed 
that spurious signals still remained in this reduced data 
set at unreasonable levels. This is not surprising given 
that the weather in Wisconsin is highly variable in the 
spring, and most of the radiation entering the system 
was atmospheric emission. Especially important was the 



presence of clouds, which can mimic a polarized signal 
to our radiometer [30]. We therefore follow the standard 
practice of deriving selection statistics based upon the 
data itself, to serve as a measure of the quality of observ- 
ing conditions. 



1. The 10 Cut 

Signals modulated only at the 2(f> frequency correspond 
to true polarization signals; thus, a signal that has re- 
sponse at both 1^ and 2^ cannot correspond to a true 
celestial signal. This can be seen in the power spectrum 
of the data (Figure 1). The upper (a) panel shows a 
roughly featureless power spectrum taken during a pe- 
riod of good weather for channel J2i. But as the weather 
worsens, due to clouds and/or increased humidity and 
water vapor, features at the and 20 frequencies appear 
(Figure 1(b)). This motivates a study of the correlation 
between 1(f) and 20 signals in the time stream. 

We computed the 10 and 20 values for each data file 
relative to the noise floor^, denoted as 10^ and 20^ re- 
spectively. Figure 2 displays these data plotted against 
each other for channel .J2i, with both instrument-based 
and celestial cuts applied. The high degree of correla- 
tion between the two quantities is striking; this strongly 
motivated the use of the 10^ level as a cut statistic. In 
order to determine a logical 10^ cut level, we compared 
our data to the behavior of white noise. Monte-Carlos 
showed that this statistic is 1.0 ±0.27 for white noise, dis- 
tributed roughly as a Gaussian. We opted to cut when- 
ever 10r > 2.1; this is more than 4cr from the mean for 
white noise. Implementing this requirement cut an addi- 
tional 15% of the data. However, even after instituting 
this cut there remained unphysically high values of 20^ 



^ We found it useful to compare the height of these harmonics rela- 
tive to the noise floor, to remove the eflfect of vajriable radiometer 
noise. 
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FIG. 3: Distribution of the Variable. The solid (black) curve 
shows the distribution of ( for channel J2, with the basic sun, 
moon, and dew cuts applied. The dotted (blue) curve shows 
the ("-distribution for simulated white noise run through our 
anti-aliasing filter. The vertical dashed curve is the cut level 
of C=4.0. 



in the data, motivating additional cuts. 



over the other cuts. Varying the actual cut level between 
C < 2 and C < 5 had little effect on the final results. 



3. Outliers in the Time-Ordered Data 

Occasionally, birds, planes, electrical noise, etc., would 
cause short-lived yet large spikes in the data stream. For 
each data file and channel, we calculated the mean and 
standard deviation, and recorded how many standard 
deviations the first, second, and third strongest outliers 
were from the mean. We cut any data file whose second 
largest outlier was more than 5cr from the mean of that 
data. Data files with one strong outlier alone rarely oc- 
curred, because our 5 Hz filters in conjunction with our 
20 Hz data acquisition frequency guaranteed that even 
a delta-function signal should have a width of at least a 
few samples in the data stream; thus the second strongest 
outlier proved a more robust indicator for such spurious 
effects. This cut removed ~ 1.2% of the data in addition 
to all previous cuts. 



2. The Zeta Cut 

Because the autocorrelation function and power spec- 
trum of any data set form a Fourier transform pair, the 
information in one is the same as in the other. For in- 
stance, a rise in 1 /f noise leads directly to a higher "floor" 
in the autocorrelation function. Any signal in the data 
will lead to a non-trivial autocorrelation function. We 
used that fact to our advantage and defined the follow- 
ing statistic for each data file: 

ElOOO ^, \2 
^ lag=l^\y^n) 

Z^lag=l ^ [I/quad) 

where yi„ denotes data from an in-phase channel and 
Vquad denotes data from the corresponding quadrature- 
phase channel. Simulations showed that a reasonable as- 
trophysical signal (of, say, less than 100 /iiiTrms), would 
have a negligible contribution to C, for POLAR. Equation 
(2) has an intuitive explanation; it is roughly the inte- 
gral of the power spectrum between 0.01 Hz (near our 
lowest observable frequency) and 10 Hz (our Nyquist fre- 
quency), weighted by 1//, so low- frequency drifts cause C, 
to increase rapidly. As such drifts are generally indicative 
of poor atmospheric conditions, C proves to be a sensible 
cut statistic. 

Figure 3 shows a histogram of C values throughout the 
season (with the basic sun, moon, and dew cuts applied). 
Also shown in the plot is a model of our data stream with 
no 1/f noise or signal of any kind. As can be seen, this 
distribution is peaked around 1.0, with more than 99.9% 
of the data lying below 1.2. However, any deviation from 
white noise will rapidly change the C value of the data. 
We found that requiring C < 4 cut about 3% of the data 



D. Duration-Based Cuts 

In order to ensure that our surviving data segments 
were obtained during long, contiguous periods of good 
weather, we instituted a battery of three duration-based 
cuts. First, we required that if a data file were to sur- 
vive, both its neighbors had to survive as well. Second, 
we required a minimum of eight consecutive data files 
(one hour of data) to survive in order to keep any piece 
of data. Lastly, we required a minimum of three hours 
total to survive in a section of data for any data in 
that section to be retained; in contrast to the cut above, 
this three hours was not required to be contiguous. This 
cut was instituted because of the offset removal step of 
the analysis pipeline, described in §VD. This algorithm 
takes place on a per-section basis, and effectively removes 
all information from very short sections of data; in the 
end we found it simpler just to remove these short sec- 
tions of data. Together, the duration-based cuts removed 
about 10% of the full data set. 



V. FROM DATA TO MAPS 

We now proceed to construct Q and U maps of the 
post-cut data set, a process which does not remove any 
cosmological information [31]. Many authors have writ- 
ten on the mapmaking problem and all the nuances that 
can arise during its solution: Wright (1996) [32]; Tegmark 
(1997), hereafter T97a [31]; Bond, Jaffc, Knox (1998), 



A section of data is a data segment that was taken without in- 
terruption (this was typically about one day of data). 
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hereafter BJK [33]; and Stompor et al. (2001), hereafter 
SOI [34]. We use the following notational eonventions. 
When discussing a vector, we use a lowercase boldface 
letter {e.g. , y) to represent it. Similarly, matrices are 
represented by uppercase boldface letters {e.g. , W). 

Wc; employ the technique of Minimum Variance Map- 
making throughout this work, defined briefly as follows; 
the interested reader is referred to T97a for a thorough 
discussion of this topic. We begin with a set of n ob- 
servations y = yi . . .yn of signal plus noise, such that 
y = Ax + n, where x is the underlying map, n repre- 
sents noise, and A is the pointing matrix that encodes 
the mapping from data space to map pixel space (see [32] 
for examples). Let N represent the data noise covariance 
matrix, where N = (nn*). The best estimate x of the 
true map x is given by 

X = [A*N-iA]^^A*N-i y (3a) 
S = [A*N~iA]-\ (3b) 

where S is the map noise covariance matrix of x. Note 
that the total map covariance matrix, C, is the sum of 
signal plus noise: C = S + S, where S = (xx*) is the the- 
ory covariance matrix. The total covariance matrix will 
be used in §VI. This method is quite general, and can be 
applied to any type of problem where a linear combina- 
tion of data is made in order to determine some physical 
quantity. Whenever the noise of said data isn't white, 
this is the best approach. For POLAR, we exploit this 
technique no fewer than four different times throughout 
the analysis pipeline. 



A. Constructing the File-Maps 

The data pipeline begins with binning the data into 
our map coordinates, via Equation (3). We formed a 
small file map for each 7. 5- minute (9000-samplc) data 
file, for each channel and in both Q and U. The "map" 
we calculated for each data file consisted of the Q and U 
parameters for whatever map pixels were observed during 
that particular file. Because of our scan strategy, a zenith 
drift scan at 6 = 43°, our maps arc one-dimensional in 
right ascension (for each of Q and U). We pixelized our 
maps with 180 pixels of width 2° in RA, with the first 
pixel arbitrarily centered at RA = 0° . The pointing ma- 
trix A was formed according to Equation (1); note that 
this matrix is not sparse, and is significantly more com- 
plicated in the case of extracting both Q and U rather 
than the usual intensity T (see [30] for further details). 

We performed binning on each data file separately be- 
cause our noise was only stationary over periods of tens 
of minutes. In principle, even nonstationary noise can 



* This estimate of x is typically termed the "COBE-style solution" 
[32, 35]. 



be treated with the standard mapmaking formalism, but 
the algorithms arc unacceptably slow because the covari- 
ance matrix N has no special properties except that it is 
symmetric, and the central rate determining step of the 
mapmaking algorithm of Equation (3) is in calculating 
N"-"^. For stationary noise, however, the corresponding 
noise covariance matrix has the special property that it 
is both symmetric and Toeplitz; that is, each upper left- 
lower right diagonal is the same. Toeplitz matrices can 
be inverted in O {nf) time, as opposed to O {nf) for gen- 
eral rit X Ut matrices [36] . However, the memory require- 
ments are still O {nf) because the inverse of a symmetric 
Toeplitz matrix is not in general Toeplitz. For POLAR, 
Ut = 9000, so simply holding the inverse matrix takes 
~ 324 MB of memory for double-precision arithmetic. 
While not prohibitive, this memory requirement moti- 
vates further study of approximation techniques. 

Wc use a slight modification of the circulant matrix 
approximation introduced in [37], which notes that a typ- 
ical noise covariance matrix N is nearly circulant. In this 
case, N = Ng-I-Ns, where Nc is the circulant component 
of N and Ng is the non-circulant component. As is 
the major contributor to N, [37] lets N — > Nc in Equa- 
tion (3a); the equation for the covariance matrix becomes 
significantly more complicated. This solution is valid in 
that it still produces an unbiased, nearly-optimal approx- 
imation of the map, at the expense of slightly increased 
noise. It has the advantage that circulant matrices can 
be inverted in O {nt In rij) time. 

However, it is possible to enact the circulant matrix 
approximation in a more streamlined way. Using the 
definitions that 

y^^ ^ N-i/2y (4a) 
Ap^ = N-i/^A (4b) 

, the circulant matrix approximation becomes 

X = [Ap^Ap^] A-py^Ypyj (5a) 
S - [A*^Ap^]-i . (5b) 

The second equation above is not precisely exact, but 
for our data it was correct to better than 1%. We state 
the mapmaking algorithm in this fashion because there 
is a very fast way to calculate yp,„ and Ap^,. Because a 
circulant matrix becomes diagonal in the Fourier domain, 
it can be shown that 

bp^ ^ N-V^b = l^i^l (6) 

where b is any vector of length rit, Sn is the power spec- 
tral density that characterizes the covariance matrix N, 

and ".7^" denotes the Fourier Transform. This approach 
is advantageous in that it requires only a simple fit to the 
noise power spectrum for the time stream of interest; it 
is unnecessary to explicitly calculate the resulting covari- 
ance matrix N. The resulting vector bp,„ is simply the 
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FIG. 4: Effect of prewhitening on a sample noise power spec- 
trum for channel J2i. Panel (a): PSD amplitude for a sample 
data file, with a knee frequency of ~ 0.07 Hz. The thick (blue) 
curve is the logarithmic- weighted fit to the PSD. The verti- 
cal dashed lines show the Icp and 2(j} rotational frequencies. 
Panel (b): Same as (a), but for the y^uj version of the data. 
The effect was to whiten the power spectrum of the data; it 
is equivalent to dividing the PSD by the fitted curve. 
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Day Number 

FIG. 5: The mean values of Q and U for the POLAR in-phase 
data versus time, after the cuts have been applied. Each 
data point represents one data file. Non-Gaussian behavior is 
immediately apparent at the 50-100 fiK level. This behavior 
was not seen the quadrature-phase data. 



prewhitened version of b. We fitted our noise to a model 
containing 1/f noise alone: 

SN{f)^<T'[l + ^^ , (7) 

where fknee is the knee frequency of the 1/f noise. It 
proved unnecessary to include the anti-aliasing filter in 
the model, as it has virtually no eS'ect on the power spec- 
trum at the 2(j) frequency, where the Q and U data live. 
We used Equation (6) to then calculate and Ap^ ^ 
for all data files passing the cuts, and applied Equation 
(5) to generate roughly one thousand 1-2 pixel maps in 
each of Q and U, along with the corresponding covariance 
matrices. 



three channels. This is in contrast to the QPC data, 
which were offset-free. Thus the offset was not electronic 
in nature, and also was not consistent with a celestial 
signal when plotted in sky coordinates. 

The underlying cause of the offset was never discov- 
ered, but there are several reasonable possibilities. K02 
hypothesizes that the quadrupolar shape of the outer 
groundscreen may be to blame [12]. [30] examines the 
characteristics of the offsets in great detail, and provid- 
ing several theories for the possible cause. If we had 
some model of the effects that could make successful pre- 
dictions about its level, we could use it to safely subtract 
out the offsets without strongly affecting our signal re- 
covery. However, without such a model we proceed along 
a different tack. 



B. The problem of the Q and U Offsets 

At this point in the analysis, a serious issue revealed 
itself. The Q and U data were not in general consistent 
with zero; there were slowly-varying offsets in Q and U 
that varied over the course of the season, and differed be- 
tween the channels. This fact is illustrated in Figure 5, 
which shows the derived time stream of Q and U values 
for all the surviving data. Notice first that the data lie 
in "chunks" along the time axis; each such chunk corre- 
sponds with a section as defined in §111. The distribution 
of Q's and J7's for each section is consistent with a Gaus- 
sian distribution, but those distributions are in general 
not centered around zero. Notice also that there is no 
clear relationship between the Q or U offset among the 



® Equation (6) was applied to each to each row of A to form the 
corresponding row of A.pni- 



C. Submap Generation 

It is a simple task to combine all the file maps for a sin- 
gle channel and Stokes parameter, within a single section 
into a submap (each corresponding roughly to one night 
of good data) . Given a set of file maps to be combined, we 
first expand them to the full 180 pixel map by assigning 
unmeasured pixels infinite noise (and hence zero weight). 
The correlations between Stokes Q and U were negligi- 
ble both within a pixel and between pixels, and were 
therefore neglected. This allowed Q and U to be treated 
completely independently throughout the analysis. We 
use the standard map combination prescription to com- 
bine the file maps into submaps (see e.g. , de Oliveira- 
Costa et al. [38]). Given a set of m independent maps 
{xi . . . Xm} with corresponding noise covariance matrices 
{Ni . . . Nm}, the best estimate of the underlying map 
X f and its associated covariance matrix N f are given by 
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TABLE II: Inter-Channel Cross-Correlation Coefficients 



N 



/ 



i=l 



.1=1 



(8a) 



(8b) 



In this way, we combined all file maps into submaps for 
each section and channel. At this step each submap 
was still left with a substantial offset in Q and U, as 
discussed previously. 







/TTTT\ 


/r»TT\ 


/TTr»\ 


(Jl J2)ipc 


0.144 ± 0.034 


0.134 


-0.024 


0.021 


(Jl J2)qpc 


0.005 ± 0.034 


0.062 


-0.011 


-0.002 


(Jl J3)iPc 


0.074 ± 0.041 


0.063 


-0.069 


-0.003 


(Jl J3)qpc 


0.023 ± 0.041 


0.048 


-0.009 


-0.042 


(J2 J3>ipc 


0.104 ± 0.041 


0.093 


-0.024 


-0.029 


(J2 J3)qpc 


0.066 ± 0.041 


0.001 


-0.054 


-0.047 



"The errors are the same within each row, and we assume that 
the underlying distribution of correlation coefficients is Gaussian. 



D. Marginalization and Submap Combination 

We next employed the technique of marginalization to 
each such submap [33], in order to remove sensitivity to 
the mean of Q and U in each. We performed the algo- 
rithm separately for each channel. Note that this tech- 
nique provides identical results to those obtained using 
the method of virtual pixels for mode removal, described 
for instance in [37]. Detailed recipes for applying the 
marginalization technique are described in SOI [34]; we 
give a cursory review here as to how it was specifically 
applied for POLAR. Given a submap y (containing Q or 
U data) with covariance matrix N, the covariance matrix 
is transformed to 

N' = N -h cT^ ZZ* (9) 

where the matrix Z contains a normalized column vector 
corresponding to each mode to be removed, and Cc is 
some large number to ensure the modes get zero weight. 
For marginalization over the mean we take Z = eo, where 
eo is the column vector of all I's. Taking the limit as 
cr^ — > oo, the inverse of N' still exists and is given by 

N'-i = N"^ - (N-iZ)[Z* N-^Z]-^(N-i Z)* (10) 

Wc then applied Equation (8) to combine all the 
submaps into a single map of Q and U for each channel, 
for both the IPC and QPC data. This recipe only re- 
quires knowing the inverse covariance matrix correspond- 
ing to each submap, given explicitly by Equation (10). 
However, we still have to perform the final inversion of 

m 

S-i^^N'-^ (11) 

i=l 

to find the final noise covariance matrix S, and is 

singular due to our marginalization over the means of 
all the submaps. Following SOI, the the final covariance 
matrix is taken to be 

S = (S-i eZZ*)-^ - e-^ZZ* (12) 

where e is any small positive number. In practice, it is 
best to choose e to be of the same order as the nonzero 
eigenvalues of 



The final maps for each of our three frequencies were 
constructed according to the recipe given above and pre- 
sented in KOI. Qualitatively, there is no visual evidence 
of a common signal among the three sub-bands, in ei- 
ther Q or U. The values from each map are also 
not consistent with a statistically significant signal. Let 
us additionally consider the corresponding maps made 
from the QPC (pure noise) channels. We do not expect 
these to contain signals, but they serve as a useful lit- 
mus test when viewing the IPC maps: if the IPC maps 
differ strongly from the QPC maps, that is evidence of 
either signal or some type of contamination in the IPC 
maps. However, that is not the case; none of the QPC 
maps contain strong outliers, and all exhibit values 
consistent with no signal. 



1. Combining the Channel Maps 

Based on the fact that there is no apparent correla- 
tion between the maps of each channel, we simplify the 
analysis by combining the three channel maps into one 
combined map (and covariance matrix) for each of Q and 
U. Since the CMB signal is independent of frequency 
(when expressed in thermodynamic units), we are free to 
do this. We would like to employ the standard map co- 
addition algorithm of Equation (8) to perform this task. 
However, that algorithm assumes that the measurements 
made of each individual map arc independent: if there is 
a systematic effect that introduces correlations between 
measurements from different channels, then we have less 
information than we think we do, and this fact must be 
taken into account in constructing the summed map. 

We first evaluated the correlation coefhcients among 
channels Jli, J2i, and J3i in the time-ordered data. As 
expected, the correlations in the time stream were on 
the order of 1%, and were due to the very slight overlap 
of the three channels in frequency space (see K02 for 
details). However, it is not the time stream correlations 
that we so much care about, it is the correlations between 
Q OT U for the channels. For instance, if there were a 
10% correlation between Jli-Q and J2i-Q, it could be 
hidden in the smaller time stream correlations. We must 
therefore evaluate these correlations directly. 
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In order to measure these correlations, we used the Q 
and U data set and determined the Pearson's correlation 
coefficient in the same way as for the time-ordered data, 
but because there is so much less data, we evaluated only 
one correlation coefficient for each surviving section. We 
calculated means and errors of these coefficients by av- 
eraging from the distribution of these values, shown in 
Table II. The numbers in this table are very suggestive. 
For instance, {Q Q) for all IPC channels is about the 
same as {U U), suggesting a common source. All QPC 
correlation coefficients are consistent with zero, as are all 
correlations of the {Q U) variety ^. As Q and U show 
no correlation between them (either within a channel or 
between channels) , we can continue treating Q and U as 
completely independent measurements. This is not too 
surprising, considering they are essentially the sin 2(j) and 
cos 2(j) projections from each rotation, which are orthog- 
onal functions. However, the correlations between IPC 
channels (for the same Stokes parameter) are ~ 10%, so 
we cannot ignore them in constructing a final map. 

Let us now combine the maps of Q (or U) from the 
three non-independent channels, armed with the knowl- 
edge of their mutual correlations. The algorithm of §VC 
did not deal with adding non-independent maps together, 
but it is relatively easy to expand the methods to do so. 
We treat Q and U separately, as they are completely un- 
correlated. Let us consider our situation for Q [U will 
follow an identical format). 

We have three channel maps, call them xi, X2, and 
X3, with their corresponding covariance matrices Si, S2, 
and S3. Let the correlation coefficient between and 
Xj be pij. We appeal to Minimum Variance Mapmak- 
ing to form the best possible map. First, we form the 
concatenated map and corresponding covariance matrix: 



5^ cat — 



{X1,X2,X3} 

Si /312S12 /313S13 

P12S12 S2 P23S23 
P13S13 P23S23 S3 



(13a) 
(13b) 



where S^j = 



Si-^/Sj. We can take the square roots of 
the S-matrices since they are all positive definite, as long 
as we add a large offset to each matrix (corresponding to 
the uncertainty in the offset, which is formally infinite)^. 
The final full covariance matrix, Sj, will then also have a 
large offset, but again this corresponds to our marginal- 
ization over the means of all the component submaps and 



Except perhaps {Qi U3), but because all the other coefficients of 
this type are consistent with zero, we assume it is an outlier. 
For a symmetric, positive-definite, n x n matrix M, its square 
root is given by P*D^/^P, where P is an n x n matrix such that 

the i"^ row of P contains the i**^ eigenvector of M, and D^/^ is 
a diagonal matrix with the square-roots of the eigenvalues of M 
along its diagonal. The eigenvectors must be normalized, such 
that PP* = I. 



is will not affect our final results. 

We now apply the Minimum Variance Mapmaking for- 
malism to the concatenated map (which in this case is our 
data vector y from Equation (3)). The pointing matrix 
is given by 



(14) 



where I„ is the n x n identity matrix, and n is the num- 
ber of pixels in our maps. This matrix points our three 
individual maps to the same final map. Explicitly, the 
final joint map and covariance matrix are given by: 



Sf 



(15a) 
(15b) 



Because of the large offset each covariance matrix pos- 
sesses, the final map x/ may have some random offset 
to it, but it is meaningless, and can be safely subtracted 
out. Notice that in the case of vanishing inter-map corre- 
lations, we correctly reproduce the algorithm of Equation 

The final joint- maps for the IPC and QPC are shown in 
Figure 6. There is no obvious evidence of an underlying 
sky signal. In order to determine robustly if they show 
evidence of a signal, we employ the standard techniques 
of maximum likelihood analysis and quadratic estima- 
tors, described fully in §VI. 



E. Testing the Mapmaking Pipeline 

We next simulated the data stream in order to test 
our map reconstruction pipeline. There were three pri- 
mary steps involved in the simulation process: build the 
underlying map, let POLAR "observe" this fake sky and 
generate data based on these observations, and construct 
maps from the resulting data set. We built the under- 
lying sky maps out of simple sine and cosine modes, for 
both total intensity (/) and Stokes Q and U. We as- 
sumed a basic flat band-power model with ^10 fiK per 
band, and convolved these signals with our 7° (FWHM) 
Gaussian beam. This signal level is of course highly un- 
realistic for true CMB polarization; we used it merely so 
we could reconstruct the actual map with good signal- 
to-noise (instead of merely providing an upper limit). 

To shorten processing time, we made the simulated in- 
strument about a factor of two more sensitive than the 
real POLAR. We included all IPC and QPC channels 
in the analysis, but no total power channels. We as- 
sumed the noise was almost white, with a small amount 
of 1/f noise in each channel in agreement with the true 
data set. We convolved each data stream with our 5 Hz 
anti-aliasing filter to ensure a resulting power spectrum 
that was similar to the actual data. We also added ran- 
dom offsets in /, Q, and U for each section and channel. 
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of levels consistent with those experienced by POLAR. 
The results of these simulations showed that our map re- 
construction algorithms worked as expected, even when 
including large random offsets for each section in the Q 
and U data, and illustrates the amazing utility of the 
marginalization technique. The Q and U reconstruction 
of one such "fake sky" is shown in Figure 7. 



VI. CONSTRAINING CMB POLARIZATION 
MODELS 

A. Limits on E and B in a Flat Band-Power Model 

1. Primary Likelihood Analysis 

We now address the question of the level of rms polar- 
ization in the joint-channel maps using a standard likeli- 
hood analysis. For the beginner, an excellent discussion 
of how to carry out such an analysis is provided in [39] . 

As POLAR measured both Stokes Q and U simulta- 
neously, we are able to set limits on both E- and B-type 
polarization independently. As described previously in 
KOI, we constrain a flat CMB power spectrum charac- 
terized by two polarization temperatures Te and Tb such 
that l{i + l)Cf /27r = r| where X G {E, B). We then 
solve for the likelihood function £(a), given by 



L{a) cx 



(a)3( 



|C(a)|V2 



(16) 



where a is our parametrization vector a = {Ts, Tb}, x = 
{q, u} is the concatenation of the joint channel maps of 
Q and U given in the previous section, and C(a) is the 
full covariance matrix given by the sum of the data (or 
noise) and theory covariance matrices: 




100 150 
Right Ascension [deg] 



200 



FIG. 7; Derived joint-channel maps from simulated data for 
one example "fake-sky" map. The thin (black) curve is the 
underlying sky map convolved with a 7° beam. The (blue) 
data points represent the derived joint-channel map, after the 
31 hours of simulated data in 5 submaps. The good agreement 
illustrates the robustness of our mapmaking algorithms and 
of the marginalization technique used to remove sensitivity to 
means of Q and \J from each contributing submap. 



C(a) = S(a) 



(17) 



The construction of the theory covariance matrix S is 
rather complicated [7, 40, 41], and has been previously 
presented for POLAR [11, 30]. The data covariance ma- 
trix S is formed from the covariance matrices for the 
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FIG. 8: Plots of the RA = 40° row of the fundamental sig- 
nal covariance matrices and S^. The four panels in the 
plot correspond to the four quadrants of the matrices as la- 
belled. The thick solid line represents S^, the thick dashed 
line represents S^, and the thin solid line corresponds to a 7° 
Gaussian beam for reference. 




FIG. 9; Normalized likelihood plot of Te, with the prior con- 
straint that Tb ~ 0. The solid line is the result for the in- 
phase channels, and the dashed line is for the quad-phase 
(null) channels. The resulting upper limit is Te < 7.7 at 
95% confidence. 



joint-channel maps of Q and U such that 



Sq 




(18) 



Note that S has a very large offset (and corresponding 
eigenvalue) due to our original marginalization over the 
submap means, but its specific magnitude will not affect 
results of the likelihood analysis. 

We calculated the likelihood function for all individ- 
ual and joint channel maps, both for the in-phase and 
quadrature-phase channels. As reported in KOI, all cases 
were consistent with pure upper limits. The combined- 
channel data yield 95% confidence limits of 10.0 on 
both Te and Tb (or 5.0 stated using the modern 
convention for defining Q and U) . As the _B-polarization 
at large angular scales is assumed to be so much weaker 
than i?-polarization (even taking into account lensing), 
we can can set Tb to be zero. It yields a 95% confidence 
limit of Te < 7.7fiK, as shown in Figure 9. 



2. Alternative Likelihood Analyses 

To verify the analysis, we repeated it in two different 
ways. First, we performed the likelihood analysis without 
the offset removal. The corresponding likelihood analy- 
sis yields a spurious detection for the in-phase channels 
at {Te,Tb) = (18, 1) ± (8, 16) ^iK (95% conf.), while the 
quadrature channels yield a 95% confidence upper limit 
of 7 for both Te and Tb- We see that the offset 
removal degraded the upper limit by ^ 30%. This is 
primarily due to the large width 25°) in right ascen- 
sion of the central lobe of the theory covariance matrix, 
as shown in Figure 8, coupled with our marginalization 
over the mean of each submap. This is consistent with 



the fact that each submap averaged roughly 90° of cov- 
erage in right ascension. This emphasizes to the experi- 
menter that having long, clean sections of data is key to 
obtaining the best final noise possible, especially if any 
kind of mode removal may take place in the analysis. 

As a further consistency check, we performed the en- 
tire analysis in such a way that the three channels were 
co-added directly in the time-ordered data. This tech- 
nique automatically takes into account any correlations 
that may be present between the channels. One simply 
co-adds the time streams with their inverse noise weight- 
ings in order to obtain a time stream with the minimum 
possible noise. Offset marginalization was still done on 
the submaps, but since we had already combined all the 
channels, a single offset was removed for Q and U for each 
section, rather than one for each channel. When the cor- 
responding likelihood contours were calculated, the up- 
per limits remained, but were degraded to about 12 ^.K. 
This makes sense considering we subtracted only two off- 
sets per section, as compared with six in the primary 
analysis. Because the offsets were not perfectly corre- 
lated among the three channels, there was residual power 
left over in the maps due to the imperfect co-addition of 
the channel offsets; it was this phenomenon that led to 
the slightly degraded upper limits. This result notwith- 
standing, this analysis is noteworthy in that it shows our 
inter-channel correlations were not a severe problem. 



B. Limits on Polarization Power Spectra 

CMB anisotropy and polarization are typically char- 
acterized by the temperature fields T, Q, and U . These 
quantities can be combined to form six measurable power 
spectra: TT, EE, BB, TE, TB, and EB (see for in- 
stance [41]). Our analysis of these spectra is presented 
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Table III: POLAR-DMR Power Spectrum 

4ff ± 51 5T^ ± a [nK'^] ST [fiK]^ 
487.0±270.6 22.lj;Jj 

-4.9± 16.0 < 3.3(5.2) 

6.9± 16.0 < 4.8(6.3) 

-18.4± 34.3 < 4.0(7.1) 

-0.1± 34.3 < 5.9(8.3) 
-25.0± 15.8 < ( 2.6) 



(a) 



T 15.6±6.6 

E 12.6±4.5 

B 12.6±4.5 

X 14.0±4.8 

Y 14.0±4.8 

Z 11.4±2.9 



'■"'All values reported using the convention Q = (T^ — Ty)/2, 
as discussed in §11. Table reproduced from [13]. 
'■'''Values in parentheses are 2-a upper limits. 



in [13]; we give a brief summary of it here. Note that, 
in contrast to the rest of this paper, temperatures in 
this section are reported using the modern convention 
of Q — [Tx — Ty)/2, as discussed in §11. 

As POLAR had very weak sensitivity to temperature 
anisotropy data, we used the COBE-DMR data for the 53 
GHz and 90 GHz bands averaged together as discussed 
in [13]. We analyzed the spectra in five bands of width 
i — but subsequently averaged them into a single band 
to increase the signal-to-noise ratio. We employed the 
method of quadratic estimators to evaluate the band- 
powers (see e.g. [41, 42]), using the concordance model 
iP'Lambda ^ 0.7, Vlmat ^ 0.3, h ^ 0.7) as iuput for the 
theory covariance matrices [43]. All band-powers were 
consistent with upper limits, the results of which are 
shown in Table 6. Note that the 2a upper limits on E 
and i3-type polarization obtained with quadratic estima- 
tors are 5.2 and 6.3 ^lK respectively, in good agreement 
with the 2(7 upper limits of 5.0 fiK obtained from the 
maximum likelihood analysis. 

This technique has the added benefit of explicitly cal- 
culating the band-power window functions, while for a 
maximum likelihood analysis, they are less straightfor- 
ward to evaluate [44]. However, there is a new twist 
on band-powers when it comes to polarization; there is 
always some amount of leakage into the desired power 
spectrum from the other five power spectra. In principle, 
it is possible to choose priors for the quadratic estima- 
tors such that fourteen of the fifteen leakages are zero, 
but the much discussed E-B leakage remains [41], al- 
though it can be kept to a minimum at the price of a 
modest increase in error bars. This leakage is a direct re- 
sult of "ambiguous" modes existing in the scan strategy; 
the more ambiguous modes, the worse will be the leak- 
age [45]. Sky coverage that is large, two-dimensional and 
well-connected will have few ambiguous modes; but PO- 
LAR's one-dimensional scan strategy leads to virtually 
all of the E and B modes being ambiguous. 

Figure 10 shows the E window function for a single 
£-band for 2 < ^ < 8 for POLAR as an example; the 
window function was generated using the minimum leak- 
age techniques discussed above. The leakage of B into 
E is exactly symmetric for E into _B, thus we show only 



J 

J 


L 

i 




















B 






10 


20 


30 



Multipole 1 

FIG. 10: The window function for E used to estimate the 
polarization band-power near £ — 5, shown as a representative 
example. The B window functions are exactly the same as for 
E, but with E and B switched. There is significant leakage of 
B-power into the iJ-estimate because of our one-dimensional 
scan strategy. In general, a scan done over a two-dimensional 
region with a large sky coverage will have a much better E-B 
separation and narrower window functions. The width of the 
window function scales with the inverse of the sky patch size 
in its narrowest dimension. 



the E window function. It is clear that there is a very 
large leakage from B to E and vice versa. Therefore, the 
upper limits for E and B cannot be taken separately; we 
constrain merely the average of the two power spectra. 

An early reionization leads to a peak in both the 
EE auto-correlation and TE cross-correlation spectra at 
large angular scales [3, 46]; this effect was observed re- 
cently in the TE power spectrum by WMAP [2]. Un- 
fortunately, our EE and TE upper limits are still signif- 
icantly higher than any possible reionization peak. For 
T = 0.4, a typical concordance model gives STee ^ 1 
and STte ~ 3 ^K, in comparison with our limits * 
of 5.2 fiK on STee and 7.1 fiK on STte- In comparison, 
WMAP recently reported r = 0.17 ±0.04 (68% conf) [2]. 



C. Constraining Polarized Synchrotron Emission 

Astrophysical synchrotron emission is generally polar- 
ized at a level from 10-50% [27, 29]. Our non-detection of 
any signal implies that we can place constraints on polar- 
ized synchrotron at 30 GHz, but this is not as straightfor- 
ward as it may seem. As discussed in §IVB 2, we cannot 
simply extrapolate from low frequency polarization mea- 
surements such as Brouw and Spoelstra [29] because of 



* Stated using the convention Q = (Tx — Ty)/2 as discussed in §11. 
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Faraday depolarization. Extrapolation of the unpolar- 
izcd Haslam 408 MHz data [47] to our frequencies using 
a speetral index of -3.1, assuming a polarization fraction 
of 10%, and smoothing with a 7° beam yields an rms 
of about 10 nK'^ . Thus, our data favor both a steep 
spectral index as suggested by recent studies [48] and a 
polarization fraction of less than 10% at large angular 
scales. This issue is explored more fully in [13]. 

VII. DISCUSSION 

To summarize our results thus far, using the POLAR 
data we have placed limits on CMB polarization power 
spectra in the context of both a flat band-power model 
and a concordance model, and we have discussed the lack 
of substantial synchrotron contamination in our scan re- 
gion. We showed in §VIA that the offset removal tech- 
nique only degraded our limits by ~ 30%, which is not 
so bad considering that without it, no limits would have 
been possible. 

If we had integrated for a much longer time and/or 
our detectors had been much more sensitive, how well 
could we have actually done in determining E and B 
on large angular scales given our scan strategy? We at- 
tempted to answer this question via an additional simu- 
lation. We used CMBFast [49] to generate power spectra 
for a concordance model imiverse with no i3-modes (i.e., 
only scalar perturbations) and an optical depth to reion- 
ization of t = 0.2; ideally we should recover only E modes 
from the analysis. We used the HEALPix package [50] 
to generate twenty sky realizations of Q and U, and ob- 
served each with a nearly perfect POLAR, one that had 
^ Stated using the convention Q = Tx —Ty as discussed in §11. 



a sensitivity of 0.01 ^iK for each of 180 2° wide pixels in 
our 6 — 43° RA strip. We then performed our likelihood 
analysis on these map realizations limited to our scan re- 
gion, and found that we recovered roughly equal amounts 
of E and B power. This is consistent with the window 
functions generated in our quadratic estimator analysis 
of §VIB. On a positive note, we did recover the correct 
amount of total E and B power; that is, the recovered 
Cf + Cf equaled the initial Cf placed in the maps. 

Ultimately, POLAR was limited both by its instru- 
ment sensitivity and atmospheric limitations to integra- 
tion time. The atmosphere was also most likely responsi- 
ble for the time- varying polarization offset. Finally, our 
one-dimensional scan strategy prevented discrimination 
between E-modcs and B-modes. Future experiments that 
effectively deal with these issues may well be able to glean 
the CMB polarization signals on largo angular scales, and 
hence uncover a wealth of new information with which to 
increase our understanding of inflation and the details of 
the Big Bang. 
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